RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
_ Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number 



Source: 



Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mai! help: Datin21hcln@usnto.20v or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: natin31ieln@tisnto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 (TFR §§1.821 - 1.825 effective October I, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 
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01 l<K,(e 



Checker Version 3.0 can be down loaded from the USPTO website at the following add 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



error DFTrcrrn SUCCESTED CORRECTION SERIAL NUMBER; Wfafy Lfty 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA* HEADERS. WHICH WERE INSERTED By „ 

1 



_ Wrapped Nucleic? 
Wrapped Aminos 



The numberAe* at the : end of etch line "wrapped- down to the next |^«. JVu may ^ irvouf - . 



2 - 

3 



>VtW ^ *"* tlut ' K ~ ««"« 72 character, in Ic'ngih. fei include, white .pacer 



_MU*ligncd Amino 
Numbering 



< Non-ASCII 

3 L Variable Length. 



_Pa(cntIn 2.0 
"bug" 



^_ Skipped Sequences 

(OLD RULES) 



* Skipped Sequences' 

(NEW RULES) 



\ Use ofn'i Of Xaa'i 

(NEW RULES) 

5. Invalid <2 1 3> 

Response 




of<220> 



Palentln 2.0 

"bug" 



.^Misuse of n 



The .ubmitted file was not aaved in ASCII(DOS) text. ., required by the Sequence Rule, New, 
ensure your,u.K,u< n < .ubrnlulon U a.rtd In ASCII Kit. 

. * » - , 

Sequence<0 contain n'« or Xaa'. representing more IhVone residue. Per Sequence Rule. 

each » or X.. can .nlyrepreaent . .Ingle residue. Please present the maximum number of e'.ch 
res.due h. vmg vanable lenglh«d indicate in the <2 20>.<223> .ection thai ,ome miyuJs^ 

A "but" in P.tentln version 2.0"ri c^ed flic <220>«323>«ection to be mining from aml - n0 tcid 

.equ«nce,<.) . Normally. P.tentln wouldwlonutieaKy generate this i^Trom £f 

pr.v.^.lycoded^c.eie.cid.eouence. ri«««U*Ul y « w £&^5^2S^ 
Artificial or Unknown Mquencca. 



Scqucncc<i) 



milJ,n S- If intentional, please iiuert the fotfowin* line, for r.rh ^ . 

m INFORMATION FOR SEQ ID NO* (tao. SEQ ID NO W.'-TJ^S *""""" 

This sequence is intentionally skipped nj 

Plea,, also adjust the "(ii) NUMBER OF SEQUENCES." response to Include the skipped sequence, 

Sg^L^St? ' f ,n,Cn " 0na, • * f °" 6Wine lin " "* Jk "" 5ed 

«00> sequence id number 

000 

Use of n's and/or Xaa's have been detected in the Sequence Listing 

£T< M0> ^0??^* U, "-. USe of< ". 0>< " 3> » MANDATORY if n", or X.a's are present 

In <220> to <223> secuon. please expla.n location ofn or X... and which residue „ or X.. represent, 

Per I 123 ofSequence Rules, the only valid <2 13> responses are: Unknown. Artif.ci.l Sequence or 

LrSaTsTq^r^' 0 ^^*^^^y>^!^^ 

, c ie " e " puin ,our " orgent,ie m ' 1 ™ 1 « <mo> q 

(S«, Fedcra, Rcg,stc,-06A>.,.99S. Vol. 63. No. 104. pp. 2963.-32) (See. 1.823 ofSequence Rule,) 

Please do not use "Copy to Disk" function of P.tentln version 2.0. This cause, a corrupted file 
esuttmg ,n rn,,,,ng mandatory numeric identifier, and response, (u indicated on r^eque^e 
lufang). Instead, please use "File M.nager" or any other manual mean, to copy f.le to Poppy disk. 

'^a£^ 

AMC/MH - Biotechnology Syrtcnu D ranch -08/21/2001 
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RAW SEQUENCE LISTING DATE : 10/16/2001 

PATENT APPLICATION: US/09/966,422 TIME: 15:52:20 

Input Set : A:\30534111.app 
Output Set: N:\CRF3\10162001\I966422.raw 

201 gccaaatatt aatgtattta ttagtttatc atatttaatt tttataacac cataaatagg 720 

202 tattaatgta cacattttat agatgaggaa aatgtggttc tgagaggtga agcattttgc 780 

203 ctagtgatca cagctaaaaa gtgatagagc tgttctttat tttaaagttc acattgtact 840 

204 accctggctc cctaatcaca gatgggcagg gtaggggttg ggtggggaca gaagttggag 900 

205 agtggatgtg gctgccaacc acacaagttg tgccaaccca cagattgagg aaagatgcta 960 

206 aatttggaat ctggcaaacc agtgtttggt tcttagctct gccacttcta agctgtgtga 1020 

207 aacttggttg aggtccctaa cttctcctga gggtgaacaa ctcacaaagt tgttttgctt 1080 

208 attaaatgtg ataacacctg taaacatcta acagagtgcc tagcacatag cagggatcta 1140 

209 gcaattgaat tagggttatt tgtttctgtc tactgattgg gtattgtttc tgacacttac 1200 

210 ccaagtgtga atagcctata acactggtat aatttgtgaa atgatgctgc catctagtga 1260 

211 aaaccaagac acacacacac acacacacac acacacacat acacacacac gtgcgcgcgc 1320 

212 atggacaccc agcttcacca atgacaatat ggattggcat gttttagcct cacaacacag 1380 

213 agccctgggg ctaactggca cctagagagg tcatctcggc cagtgccttc caaactacca 1440 

214 gtgctgaaaa gccagttcaa aaaattttga acccattgca caccaatatt tttgtgaaat 1500 
1,1 a ^ataaaaa taaattactg gaaaaatgaa ataaaaaata tgtataaaat acaaaccaaa 1560 

216 attttagaac tgttagattc aacagcaaaa aattgctgta tacatctctg accaattgct 1620 

217 ttcagtttct gtgcttatct ctctacgacc tttgtaacac acagtgaacc agcgctggcc 16 80 

218 catggataca ctctagtagc cccaatctag ctaaggcagc cccttatagt taatcaatcc 1740 

219 tgtcaaacag gaaaggctgg caaaaccact ggtctgcatg tactttgtcc tttacacaag 1800 

220 gaaggatgca aacgtggaaa actgagtgga catggtgttc aggagattga ggctcagcta 1860 

221 aattccagct tatttacctg cagttgctta caaagtgttt ggacataatt gtgtaaagct 1920 

222 agggtttttt ttctggtttt taaaacaggt aaaggatgtc acagcaccac ttaataacat 1980 

223 ttcttctgaa gtccagattt taacatctga tgccaataaa ttaactgctg agaacatcac 2040 

224 tagtgctacg cgagtggttg gacagatatt caacacttcc agaaatgctt cacctgaggc 2100 

225 aaagaaagtt gccatagtaa cagtgagtca actcctagat gccagtgaag atgcttttca 2160 

226 aagagttgct gctactgcta atgatgatgc ccttacaacg cttattgagc aa 2212 

229 <210> SEQ ID NO : 4 

230 <211> LENGTH: 449 

231 <212> TYPE: DNA 

232 <213> ORGANISM: Homo sapiens 
234 <220> FEATURE: 
2 36 <4 00> SEQUENCE: 4 

237 acagtaaaac ttacctgttg tggtcttttt aatcacctcg tttgagtttt atctgtttct 6 0 

238 ctcctttatt tcccagtcct ctcagaaagt cttcctcaat gtattttgct caggattaag 120 
III ffttagataa aacctgttgt ttattattat tcggcataat ggacttggta gtttttctat 180 
J!? tl tT a ll q attt 9 tactt 9-aataaggtg aagaatttca cacaacatac aagagtacca 240 

241 ttgttcctta tatcgttaaa tctttgtgac acactttgac aaaaatgtag aacctataac 300 

242 aaattctttt acaagttact ataaaggaca caaagagaaa actttacctt ccagaacaaa 360 

243 atgactcctg atgaacagtg tgtggggatt tgcttgtatg tattaaactt ttgacctctg 420 

244 aaaaaaaaaa aaaaaaaaaa aaaaaaaag AAQ 

247 <210> SEQ ID NO: 5 

248 <211> LENGTH: 80 
24 9 <212> TYPE: DNA 

250 <213> ORGANISM: Artificial Sequence 

252 <220> FEATURE: 

253 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetici^ 
2 54 oligos 

256 <400> SEQUENCE: 5 

257 gctgtgcagc gctgagtgcg ttccaggtaa atgtcactaa cagaaaatag tgcagtaagg 60 
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OIPE 



RAW SEQUENCE LISTING DATE: 10/16/2001 

PATENT APPLICATION: US/09/966,422 TIME: 15:52:20 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\I966422.raw 

3 <110> APPLICANT: Feder, J. N. 

4 - Mintier , G . 

5 Ramanathan, C. S. 

6 Hawken, D. R. 

7 Cacace, A. 

8 Barber, L. 

9 Kornacker, M. G. 

11 <120> TITLE OF INVENTION: A ' NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR, HGPRBMY6 , 

12 EXPRESSED HIGHLY IN SMALL INTESTINE 
14 <130> FILE REFERENCE: D004 ONP/3053 - 4119US3 

C--> 16 <140> CURRENT APPLICATION NUMBER: US/09/966,422 d\( D08S Not Comply 

17 <i4i> current filing date: 2001-09-26 Corrected Diskette Heeded 

19 <150> PRIOR APPLICATION NUMBER: 60/235,602 

20 <151> PRIOR FILING DATE: 2000-09-27 f 

22 <150> PRIOR APPLICATION NUMBER: 60/306, 604 \fCT^ -) 

23 <151> PRIOR FILING DATE: 2001-07-19 <-v ^ y 

2 5 <150> PRIOR APPLICATION NUMBER: 60/315,412 & rrcn^ J**** y~ ^ 

26 <151> PRIOR FILING DATE: 2001-08-28 ^Ux (, * £ 7 A" 

28 <160> NUMBER OF SEQ ID NOS : 81 TrT • 

30 <170> SOFTWARE: Patentln Ver . 2.1 \ 1 > 

32 <210> SEQ ID NO: 1 t 

33 <211> LENGTH: 1683 

34 <212> TYPE: DNA ' 

35 <213> ORGANISM: Homo sapiens 

37 <4 00> SEQUENCE: 1 

38 atggagactt attccttgtc tttgggtaat caatcagtgg tggaacctaa catagcaata 60 

39 cagtcagcaa atttctcttc agaaaatgcg gtggggcctt caaatgttcg cttctctgtg 120 

40 cagaaaggag ctagcagttc tctagtttct agttcaacat ttatacatac aaatgtggat 180 

41 ggccttaacc cagatgcaca gactgagctt caggtcttgc ttaatatgac gaaaaattac 240 

42 accaagacat gcggctttgt agtttatcaa aatgacaagc ttttccaatc aaaaactttt 300 

43 acagctaaat cggattttag tcaaaaaatt atctcaagca aaactgatga aaatgagcaa 360 

44 gatcagagtg cttctgttga catggtcttt agtccaaagt acaaccaaaa agaatttcaa 420 

45 ctctattcct atgcctgtgt ctattggaat ttgtcagcga aggactggga cacatatggc 480 

46 tgtcaaaaag acaagggcac tgatggattc ctgcgctgcc gctgcaacca tactactaat 540 

47 tttgctgtat taatgacttt caaaaaggat tatcaatatc ccaaatcact tgacatatta 600 
4 8 tccaacgttg gatgtgcact gtctgttact ggtctggctc tcacagttat atttcagatt 660 

49 gtcaccagga aagtcagaaa aacctcagta acctgggttt tggtcaatct gtgcatatca 720 

50 atgttgattt tcaacctcct ctttgtgttt ggaattgaaa actccaataa gaacttgcag 780 

51 acaagtgatg gtgacatcaa taatattgac tttgacaata atgacatacc caggacagac 840 

52 accattaaca tcccgaatcc catgtgcact gcgattgccg ccttactgca ctattttctg 900 

53 ttagtgacat ttacctggaa cgcactcagc gctgcacagc tctattacct tctaataagg 960 

54 accatgaagc ctcttcctcg gcatttcatt cttttcatct cattaattgg atggggagtc 1020 

55 ccagctatag tagtggctat aacagtggga gttatttatt ctcagaatgg aaataatcca 1080 

56 cagtgggaat tagactaccg gcaagagaaa atctgctggc tggcaattcc agaacccaat 1140 

57 ggtgttataa aaagtccgct gttgtggtca ttcatcgtac ctgtaaccat tatcctcatc 1200 

58 agcaatgttg ttatgtttat tacaatctcg atcaaagtgc tgtggaagaa taaccagaac 1260 

59 ctgacaagca caaaaaaagt ttcatccatg aagaagattg ttagcacatt atctgttgca 1320 
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RAW SEQUENCE LISTING DATE . in/ifi/onm 

PATENT APPLICATION: US/09/966,422 Till': 15:52:2^ 

Input Set : A:\30534111.app 

Output Set: N:\CBF3\10162001\I966422.raw 

I HI PS =S 55 S Si; S 

63 ctatcgtcta ttgggagaag gaagtcattg ccttcagtga cgcggccgag gctgca?S ifso 

64 aagatgtata atttcctcag gtcattgcca accttacatg aacgcttSg lion 

65 acctctccga gtactgagga aatcacactc tctgaaagtg" acaa'tgca^ j££S£ ie"o° 

69 <210> SEQ ID NO: 2 1683 

70 <211> LENGTH: 560 

71 <212> TYPE: PRT 

72 <213> ORGANISM: Homo sapiens 

74 <400> SEQUENCE: 2 

75 Met Glu Thr Tyr Ser Leu Ser Leu Gly Asn Gin Ser Val Val Glu Pro 
78 Asn lie Ala lie Gin Ser Ala Asn Phe Ser Ser Glu Asn Ala Val Gly 



25 



81 Pro Ser Asn Val Arg Phe Ser Val Gin Lys Gly Ala Ser Ser Ser Leu 
84 Val ser Ser Ser Thr Phe lie „" Thr Asn Val Asp Gly Leu- Asn Pro 
87 Asp Ala Gin Thr Glu Leu Gin Val Leu Leu Asn Met Thr Lys Asn Tyr 

90 Thr Lys Thr Cys Gly Phe Val Val Tyr Gin As'n Asp Lys Leu Phe Gin 

90 

93 ser Lys Thr Phe Thr Ala Lys Ser Asp Phe Ser Gin Lys He lie Ser 

rZ 0 105 110 

96 Ser Lys Thr Asp Glu Asn Glu Gin Asp Gin ser Ala Ser Val Asp Met 

100 Val ^30 Ser ^ ^ A ^ 5 Gln LyS G1U Phe Gln Tyr Ser Tyr 

102 Ala Cys Val Tyr Trp Asn Leu Ser Ala Lys Asp Trp Asp Thr Tyr Gly 



150 



105 Cys Gin Lys Asp Lys Gly Thr Asp Gly Phe III Arg Cys Arg Cys Asn 



160 



170 



108 His Thr Thr Asn Phe Ala Val Leu Met Thr Phe Lys Lys Asp Tyr Gin 
111 Tyr Pro Lys Ser Leu Asp He Leu Ser Asn Val Gly Cys Ala Leu Ser 



200 



112 195 

114 Val Thr Gly Leu Ala Leu Thr Val" lie Phe Gin He Val" Thr Arg Lys 
117 Val Arg Lys Thr Ser Val Thr Trp Val Leu Val A S n Leu Cys He Ser 
120 Met Leu He Phe Asn Leu Leu Phe Val Phe "y He Glu Asn Ser Asn 



250 



123 Lys Asn Leu Gin Thr Ser Asp Gly Asp lie Asn Asn He Asp Phe' Asp 



265 



126 Asn Asn Asp He Pro Arg Thr Asp Thr He Asn He Pro III Pro Met 



275 



280 



129 Cys Thr Ala He Ala Ala Leu Leu His Tyr Phe Leu l!u Val Thr Phe 
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RAW SEQUENCE LISTING DATE . 

PATENT APPLICATION : US/09/966,422 SSi 15:52:20 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\l966422.raw 

258 cggcaatcgc agtgcacatg 

261 <210> SEQ ID NO: 6 80 

262 <211> LENGTH: 23 

263 <212> TYPE: DNA 

264 <213> ORGANISM: Artificial Sequence 

266 <220> FEATURE: 

267 <223> OTHER INFORMATION: DescriDtinn o-F a^a-f- • -> „ 

268 oligos description of Artificial Sequence : Synthetic , _ 

270 <400> SEQUENCE: 6 

271 cagacaccat taacatcccg aat 

274 <210> SEQ ID NO : 7 23 

275 <211> LENGTH: 22 

276 <212> TYPE: DNA 

277 <213> ORGANISM: Artificial Sequence 

279 <220> FEATURE: 

280 <223> OTHER^FOKMATION: Description of Artificial Sequence : Synthetic / 

283 <400> SEQUENCE: 7 

284 agaatgaaat gccgaggaag ag 

287 <210> SEQ ID NO : 8 22 
*288 <211> LENGTH: 1230 

289 <212> TYPE: PRT 

290 <213> ORGANISM: RAT 

292 <4 00> SEQUENCE: 8 

293 Met cys Pro Pro Gin Leu Phe lie Leu Met Met Leu Leu Ala Pro Val 
296 Val His Ala Phe Ser Arg Ala Pro lie Pro Met Ala Val Val 'Arg Arg 
299 Glu Leu Ser Cys Glu Ser Tyr Pro ill Glu Leu Arg Cys p ^ Gly Thr 

303 V S G1U S ?S A " a ° ^ *J* ^ ASP AS * **' 

305 lie Cys Asp Ser Asp Pro Ala Gin Met Glu Asn lie Arg Cys Tyr Leu 

308 Pro Asp Ala Tyr Lys He Met Ser Gin Arg Cys" Asn Asn Arg Thr Gin 

311 Cys Ala Val Val Ala Gly Pro Asp Val Phe Pro Asp Pro Cys Pro Gly 

314 Thr Tyr Lys Tyr Leu Glu Val Gin T Jr Glu Cys Val Pro Tyr Lys Val 

317 Glu Gin Lys Val Phe Leu Cys Pro Gly Leu Leu Lys GlJ Val Tyr Gin 

320 Ser Glu His Leu Phe Glu Ser Asp His Gin Ser Gl^ Ala Trp Cys Lys 

323 Asp Pro Leu Gin Ala Ser Asp Lys He Tyr T Jr Met Pro Tr P Thr Pro 

326 Tyr Arg Thr Asp Thr Leu Thr Glu Tyr sir Ser Lys Asp Asp III xie 

329 Ala- Gly Arg Pro Thr Thr Thr Tyr Lys' Leu Pro His Arg 111 Asp Gly 
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VERIFICATION SUMMARY DATE: 10/16/2001 

PATENT APPLICATION: US/09/966, 422 TIME: 15:52:21 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\I966422.raw 

L:16 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:1336 M:258 W: Mandatory Feature missingj^e^u^^EATURE : a/T^ +*>^/ 

L:1336 M:258 W: Mandatory Feature missin^^^i^^OTHER INFORMATION: Cz^f^rC 

L:2066 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#/ 

L:2066 M:258 W: Mandatory Feature missing, <222> not found for 

L:2066 M:341 W: (46) "n n or "Xaa" used, for SEQ ID#:67 

L:2067 M:258 W: Mandatory Feature missing, <221> not found 

L:2067 M:258 W: Mandatory Feature missing, <222> not found 

L:2067 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:67 

L:2080 M:258 W: Mandatory Feature missing, <221> not found 

L:2080 M:258 W: Mandatory Feature missing, <222> not found 

L:2080 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 

L:2081 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:68 

L:2081 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:68 

L:2081 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 
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<210> 16 
<211> 41 
<212> PRT 

<213> Artificial Sequence 
<400> 16 




